首页> 外文OA文献 >An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
【2h】

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

机译:强化学习在对话策略选择中的应用   在电子邮件的口语对话系统中

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper describes a novel method by which a spoken dialogue system canlearn to choose an optimal dialogue strategy from its experience interactingwith human users. The method is based on a combination of reinforcementlearning and performance modeling of spoken dialogue systems. The reinforcementlearning component applies Q-learning (Watkins, 1989), while the performancemodeling component applies the PARADISE evaluation framework (Walker et al.,1997) to learn the performance function (reward) used in reinforcementlearning. We illustrate the method with a spoken dialogue system named ELVIS(EmaiL Voice Interactive System), that supports access to email over the phone.We conduct a set of experiments for training an optimal dialogue strategy on acorpus of 219 dialogues in which human users interact with ELVIS over thephone. We then test that strategy on a corpus of 18 dialogues. We show thatELVIS can learn to optimize its strategy selection for agent initiative, forreading messages, and for summarizing email folders.
机译:本文介绍了一种新颖的方法,通过该方法,口语对话系统可以从与人类用户交互的经验中学习选择最佳对话策略。该方法基于语音对话系统的强化学习和性能建模的组合。强化学习部分应用Q学习(Watkins,1989),而绩效建模部分应用PARADISE评估框架(Walker等,1997)来学习强化学习中使用的绩效函数(奖励)。我们使用名为ELVIS(EmaiL语音交互系统)的口语对话系统来说明该方法,该系统支持通过电话访问电子邮件。我们进行了一组实验,以针对与人类用户互动的219种对话进行训练最佳对话策略。 ELVIS通过电话。然后,我们在18个对话的语料库中测试该策略。我们表明,ELVIS可以学习优化其策略选择,以实现座席主动性,阅读消息以及汇总电子邮件文件夹。

著录项

  • 作者

    Walker, M. A.;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号